(19) 



J 



Europaisches Patentamt 
European Patent Office 
Office europeen des brevets 




(11) 



EP 0 834 809 A2 



(12) 



EUROPEAN PATENT APPLICATION 



(43) 


Date of publication: 


(51) Int CI. . ^UOi WHO, owoi J7/*t«+wi 




08.04.1998 Bulletin l99o/i5 




(21) 


Applicatbn number: 97480048.4 




(22) 


not A r\i filinn* 1Q flA IQQT 
Uai6 OT Tlilny. ls7.UO. Is99r 




(84) 


Designated Contracting States: 


(72) Inventor: Bereiter, Thomas W. 


AT BE CH DE DK ES Fl FR GB GR IE IT LI LU MC 


Austin, Texas 78703 (US) 




NL PT SE 






Designated Extension States: 


(74) Representative: Schuffenecker, Thierry 




AL LT LV RO Si 


Compagnie IBM France, 






Departement de Propriete Intellectuelle 


(30) 


Priority: 01.10.1996 US 724663 


06610 La Gaude(FR) 


(71) 


Applicant: INTERNATIONAL BUSINESS 






MACHINES CORPORATION 






Armonk, NY 10504 (US) 




(54) 


Scaleable and extensible system management architecture witli dataless endpoints 



< 

o 

00 

CO 
00 

o 

Q. 
LU 



(57) A large distributed enterprise includes comput- 
ing resources that are organized into one or more man- 
aged regions, each region being managed by a server 
machine servicing one or more gateway machines, with 
each gateway machine servicing a plurality of endpoint 
machines. A distributed system management frame- 
work is supported on the gateway machines and the one 
or more endpoint machines to carry out system man- 
agement tasks. To enhance scalability, the endpoint ma- 
chines support a low cost, low maintenance client com- 
ponent of the system management framework, and a 
corresponding server component is supported on each 
of the gateway machines. On an as-needed basis, ap- 
propriate executable code and system management da- 
ta is delivered from a gateway to one or more endpoint 
machines to facilitate execution of a system manage- 
ment task for the managed region. Typically, the system 
management data is not stored in the endpoint, and this 
"dataless" approach reduces the complexity and main- 
tenance costs associated with distributing the function- 
ality of the system management framework. The end- 
points are easily extensible to include new application 
functionality without requiring the overall framework to 
be rebuilt or reinstalled. 
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Descript -zm 

TECHNICAL FIELD 

The present invention is directed to managing a 
large distributed computer enterprise environment. 

BACKGROUND OF THE INVENTION 

Managing a computer network comprising thou- 
sands of nodes can produce serious difficulties for sys- 
tem administrators. Management tasks, such as distri- 
bution of system-wide changes, must be carried out 
quickly and in a dependable manner in order to reduce 
the probability of catastrophic failure. Distributed com- 
puting environments thiat are known in the art do not 
scale easily to large size. One of the root causes of this 
deficiency Is that prior art management environments 
include high overhead applications that, typically, are 
run on all of the managed machines in the network. In 
such systems, even machines located at the bwest lev- 
el of management functionality (so-called "endpoints") 
often include a full suite of management routines and 
systems management data. Because the endpoint ma- 
chine has its own database, it has to be backcd-up to 
accurately back up the overall environment. 

As the number of such machines gets large, the 
time to backup all the distributed databases becomes 
great and the backup storage requirements become un- 
manageable. When endpoint machines fail, or when us- 
ers accidentally remove files. It is an enormous burden 
on system administrators to have to locate and restore 
the endpoint's database, especially if the missing data- 
base prevents the overall managed architecture from 
being able to distribute to the failed endpoint. Moreover, 
adding new application functionality to an endpoint ma- 
chine typk:ally requires the overall management archi- 
tecture to be re-built, re-installed or, at least, re-initial- 
ized. This is a time-consuming, complex administrative 
task that severely limits the flexibility and increases the 
cost of system management. As a result of these prob- 
lems: it has not been possible to increase the size or 
"scalability" of such networks to a true "enterprise" level. 

The present invention addresses and solves these 
problems. 

BRIEF SUMMARY OF THE INVENTION 

It is a primary object of the invention to effectively 
manage computing resources in a large distributed en- 
terprise environment. 

It is another object of the invention to enable an en- 
terprise to place substantially all of its computing re- 
sources on a network that is managed in a reliable, cost- 
effective manner: 

It is still another object to reduce the complexity and 
cost of systems management in a large enterprise en- 
vironment by supporting a low cost, low maintenance 



management framework on the vast majority of ma- 
chines (e.g., the personal computers or PC/Es) in the 
enterprise. 

Yet another object is to enhance the scalability of a 
5 large distributed computing network by distributing the 
functionality of a system management framework in ac- 
cordance with a "client-server" paradigm. 

A more specific object of the invention is to imple- 
ment a low cost, low maintenance component of a sys- 
10 tem management framework at the endpoint machines 
that comprise the largest percentage of computing re- 
sources in the enterprise environment. 

It is another more particular object to support a min- 
imal set of applications on "dataless" endpoint machines 
IS of a large, distributed environment to facilitate systems 
management. 

It is still another object of the invention to meet the 
needs of customers with very large and geographically- 
dispersed networks and. more particularly, to signifi- 
20 cantly expand the scalability parameters of traditional 
management tools and techniques. 

Yet another importanhobject is to enable PC con- 
nectivity in a large centrally-managed network enter- 
prise. 

25 These and other objects are achieved in a large dis- 
tributed enterprise that includes computing resources 
organized into one or more managed regions, each re- 
gion being managed by a sen/er machine servicing one 
or more gateway machines, with each gateway machine 

30 servicing a plurality of endpoint machines. A system 
management framework is "distributed" on the gateway 
machines and the one or more endpoint machines to 
carry out system management tasks. To enhance scale- 
ability, the endpoint machines support a low cost, low 

35 maintenarice client component of the system manage- 
ment framework, and a corresponding server compo- 
nent is supported on each of the gateway machines. On 
an as-needed basis, system management data (and ex- 
ecutable code, if necessary) is delivered from a gateway 

40 to one or more endpoint nnachines to facilitate execution 
of a system management task for the managed region. 
Typically, the system management data is not stored in 
the endpoint, and this "dataless" approach reduces the 
complexity and maintenance costs associated with dis- 
tributing the functionality of the system management 
framework. The endpoints are easily "extensible" to in- 
clude new application functionality without requiring the 
overall framework to be rebuilt or reinslalled. 

A preferred method of executing a system manage- 

50 ment task affecting the managed region begins by de- 
livering executable code and system management data 
required for the system management task from a gate- 
way machine to one or more endpoint machines serv- 
iced by the gateway. The executable is a shell script, a 

55 _ specialized script, a compiled program or any other kind 
of valid executable. When a system management task 
is created, the executable is stored on disk, and a ref- 
erence to the disk file is stored as an attribute in an ob- 
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ject database in the gateway machine. Upon receipt of 
the executable code and system management data 
Irom the gateway machine, the client component of the 
management framework (supported on the endpoint 
machine) then carries out the management task. Pref- 
erably, data is not cached on the endpoint, and the end- 
point returns to its normally "idle" state after the task is 
completed. 

An endpoint computer connectable into such an en- 
terprise thus includes a processor, an operating system, 
a graphical user interface, and a client component of a 
system management framework, the client component 
having an associated server component supported on 
a gateway machine that services that computer. The cli- 
ent component includes means responsive to receipt of 
executables and system management data from the 
gateway machine to facilitate executbn of a system 
management task into which the computer is connect- 
ed Preferably, the system management framework is 
object-oriented. 

The foregoing has outlined some of the more perti- 
nent objects of the present invention. These objects 
should be construed to be merely illustrative of some of 
the more prominent features and applications of the in- 
vention. Many other beneficial results can bo attained 
by applying the disclosed invention in a different manner 
or modifying the invention as will be described. Accord- 
ingly, other objects and a fuller understanding of the in- 
vention may be had by referring to the following Detailed 
Description of the preferred embodiment. 

BRIEF DESCRIPTION OF THE DRAWINGS 

For a more complete understanding of the present 
invention and the advantages thereof, reference should 
be made to the following Detailed Description taken in 
connection with the accompanying drawings in which: 

FIGURE 1 illustrates a simplified diagram showing 
- a large distributed computing enterprise environ- 
; ment in which the present invention is implemented; 
r FIGURE 2 is a block diagram of the system man- 
agement framework illustrating how the framework 
functionality is distributed across the gateway and 
its endpoints within a managed region; 
FIGURE 2 A is a block diagram of the elements that 
comprise the LCF client component of the system 
management framework; 

FIGURE 3 illustrates a smaller "workgroup" imple- 
mentation (e.g.. a local area network) of the enter- 
prise in which the server and gateway functions are 
supported on the same machine; 
FIGURE 4 shows a simplified representation of how 
a system administrator implements a system man- 
agement task; 

FIGURE 5 illustrates how management applica- 
tions are constructed in an object-oriented manner 
for use in the present invention: 
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FIGURE 6 illustrates the ORB/BOA object-invoca- 
tion mechanism used by the present invention; 
FIGURE 7 is a flowchart illustrating how a gateway 
machine invokes a method of an object running on 
an endpoint machine to facilitate a management 
task; 

FIGURE 8A is a flowchart illustrating how an end- 
point machine initiates a file pull system manage- 
ment operation; and 

FIGURE 88 is a flowchart illustrating how an end- 
point machine initiates an inventory system man- 
agement operation. 

DETAILED DESCRIPTION 



Referring now to FIGURE 1 . the invention is prefer- 
ably implemented in a large distributed computer envi- 
ronment 1 0 comprising up to thousands or even tens of 
thousands of "nodes." The nodes will typically be geo- 
20 graphically dispersed and the overall environment is 
"managed" in a distributed manner. Preferably, the man- 
aged environment (ME.)js logically broken down into a 
series of loosely-connected managed regions (MR) 12, 
each with its own server 14 for managing local resourc- 
es es with the MR. Multiple servers 1 4 coordinate activities 
across the enterprise and permit remote site manage- 
ment and operation. Each server 14 serves a number 
of gateway machines 16. each of which in turn support 
a plurality of endpoints 18. The server 14 coordinates 
30 all activity within the MR using a terminal node manager 
20. 

Referring now to FIGURE 2. each gateway machine 
16 runs a server component 22 of a system manage- 
ment framework. The server component 22 is multl- 
3S threaded runtime process that comprises several com- 
ponents: an object request broker or "ORB" 21 , an au- 
thorization service 23, object location service 25 and ba- 
sic object adaptor or "BOA" 27. Server component 22 
also includes an object library 29. Preferably, the ORB 
40 21 runs continuously, separate from the operating sys- 
tem, and it communicates with both sender and client 
processes through separate stubs and skeletons via an 
interprocess communication (IPC) facility 19. In partic- 
ular, a secure remote procedure call (RPC) is used to 
45 invoke operations on remote objects. Gateway ma- 
chines 16 also includes an operating system 15 and a 
threads mechanism 17. 

The system management framework includes a cli- 
ent component 24 supported on each of the endpoint 
so machines 1 8. The client component 24 is a low cost, low 
maintenance application suite that is preferably "data- 
less" in the sense that system management data is not 
cached or stored there in a persistent manner. Imple- 
mentation of the management framework in this "client- 
55 server" manner has significant advantages over the pri- 
~ or art. and it facilitates the connectivity of personal com- 
puters into the managed environment. Using an object- 
oriented approach, the system management framework 
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facilitates execution of system management tasks re- 
quired to manage the resources in the MR. Such tasks 
are quite varied and include, without limitation, file and 
data distribution, network usage monitoring, user man- 
agement, printer or other resource configuration man- 
agement, and the like. System management tasks in- 
volve system management "data', which generally is 
the information collected or distributed as part of a par- 
ticular system management task. 

In the large enterprise such as illustrated in FIGURE 
1 . there is one server per MR with some number of gate- 
ways. For a workgroup-size installation (e.g., a local ar- 
ea network or "LAN") such as illustrated in FIGURE 3. 
a single sen/er-class machine may be used as the serv- 
er and gateway, and the client machines would run the 
lost cost framework. References herein to a distinct 
server and one or more gateway(s) should thus not be 
taken by way of limitation as these elements may be 
combined into a single platform. For intermediate size 
installations! the MR grows breadth-wise, with addilion- 
al gateways then being used to balance the bad of the 
endpoints. 

The server is the top-level authority over all gateway and 
endpoints. The server maintains an endpoint list, which 
keeps track of every endpoint in a managed region. This 
list contains all information necessary to uniquely iden- 
tify and manage endpoints including, without limitation, 
such information as name, location, and machine type. 
The server also maintains the mapping between end- 
point and gateway, and this mapping is dynamic. Based 
on site-specific settings, it is possible to reassign end- 
points when gateways go down or to automatically add 
new endpoints as they appear on the network. 

As noted above, there are one or more gateways 
per managed region. A gateway is a full managed node 
that has been configured to operate as a gateway. Ini- 
tially, a gateway "knows" nothing about endpoints. As 
endpoints login (discussed below), the gateway builds 
an endpoint list for its endpoints^ The gateway's duties 
include: listening for endpoint login requests, listening 
for endpoint upcall requests, and (its main task) acting 
as a gateway for method invocations on endpoints. 

As also discussed above, the endpoint is a machine 
running the system management framework client com- 
ponent, which is referred to herein as the low costf rame- 
work (LGF). The LCF has two main parts as illustrated 
in FIGURE 2A: the Icf daemon 24a and an application 
runtime library 24b. The LCF daemon 24a is responsible 
for endpoint login and for spawning application endpoint 
executables. Once an executable is spawned, the LCF 
daemon 24a has no further interaction with it. Each ex- 
ecutable is linked with the application runtime library 
24b, which handles alt further communication with the 
gateway. 

Preferably, the server and each of the gateways is 
a computer or "machine.". For example, each computer 
may be a RISC System/6000« (a reduced instruction set 
or so-called RISC-based workstation) running the AIX 



(Advanced Interactive Executive) operating system, 
preferably Version 3.2.5 or greater The AIX operating 
system is compatible at the application interface level 
with the UNIX operating system, version 5.2. 
5 The various models of the RISC-based computers 
are described in many publications of the IBM Corpora- 
tion, for example. RISC Svstem/6000. 7073 and 7016 
POWERstation and POWERserver Hardware Technical 
Reference. Order No. SA23-2644-00. The AIX operat- 
10 ing system is described in AIX Operating System Tech- 
nical Reference, published by IBM Corporation, First 
Edition (November. 1 985). and other publications. A de- 
tailed description of the design of the UNIX operating 
system is found in a book by Maurice_J. Bach, Design 
IS of the Unix Operating Svstem. published by Prentice- 
Hall (1986). Suitable alternative machines include: an 
IBM-compatible PC 486 or higher running Novell Unix- 
Ware 2.0. an AT&T 3000 series running AT&TUNIX 
SVR4 MP-RAS Release 2.02 or greater. Data General 
20 AViiON series running DG/UX version 5.4R3.00 or 
greater, an HP9000/700 and 800 series running HP/UX 
9.00 through HP/UX 9.057-Motorola 88K series running 
SVR4 version R40V4.2. a Sun SPARC series running 
Solaris 2.3 or 2.4, or a Sun SPARC series running Su- 
25 nOS 4.1.2 or 4.1.3. Of course, other machines and/or 
operating systems may be used as well for the gateway 
and server machines. 

Each endpoint is also a computer. In one preferred 
embodiment of the invention, most of the endpoints are 
30 personal computers (e.g., desktop machines or lap- 
tops). In this architecture, the endpoints need not be 
high powered or complex machines or workstations. 
One or more of the endpoints may be a notebook com- 
puter, e.g., the IBM ThinkPad" machine, or some other 
35 Intel x86 or Pentium«-based computer running Win-, 
dows 3.1 or greater operating system. IBM« or IBM- 
compatible machines running under the OS/2« operat- 
ing system may also be implemented as the endpoints. 

For more information on the OS/2 operating system. the.. 

40 reader is directed to OS/2 2.0 Technical Library. Pro- 
gramming Guide Volumes 1 -3 Version 2.00, Order Nos. 
10G6261 . 10G6495 and 10G6494. 

As noted above, the server-class framework run- 
ning on each gateway machine is multi-threaded and is 
45 capable of maintaining hundreds of simultaneous net- 
work connections to remote nr>achines. A thread of ex- 
ecution may be a separate process (in the UNIX para- 
digm) or a separate thread in a single process (in the 
POSIX pthreads paradigm), P031X is a series of stand- 
so ards for applications and user interfaces to open sys- 
tems, issued by the Institute of Electrical and Electronics 
Engineers Inc. (IEEE). The IEEE POSIX.Ic is the 
emerging standard for user level multi -threaded pro- 
gramming and is implemented in the served component 
55 ^ of the systems management framework. All objects in 
this framework exhibit "state." This state may be com- 
pletely persistent, in which case it is represented by at- 
tributes in the object database associated with a gate- 
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way machine, or the state may be non-persistent An 
example o1 the latter might be the current list of ma- 
chines that are down. 

In the preferred embodiment, the LCF is the small- 
est amount of code that can still do useful endpoint work. 
Generally, this means that the LCF code has the follow- 
ing characteristics: single-threaded, limited cascade 
and upcall capability, no persistent attributes and only a 
small set of CORBA runtime. Machines that see little 
management activity in a typical day are those for which 
the LCF is particularly advantageous. Machines that 
need to service a large volume of requests preferably 
run the server component of the system management 
framework. 

The LCF is always ready to do management tasks, 
but consumes tew resources because it is normally in 
an idle state. Preferably, each endpoint is "dataless" in 
the sense that system management data is not stored 
therein before or after a particular system management 
task is implemented or carried out. Therefore, unlike the 
prior art: there is no need to perform system backup or 
other conventional maintenance on the endpoint (at 
least with respect to the system management frame- 
work). Maintenance on the sen/er and gateway ma- 
chines suffices. Similarly, now application functionality 
may be added to the Icf without rebuilding, reinstalling 
or re-initializing the client component into the system 
management framework. 

Since endpoints are dataless and readily extensi- 
ble, the architecture is easily scaleable as illustrated in 
FIGURE 1 . The client component of the system man- 
agement framework is inexpensive and requires little or 
no maintenance since much of the system management 
framework is off-loaded to the gateway machines that 
run the multi-threaded, sen/er component. This archi- 
tecture advantageously enables a rational partitioning 
of the enterprise with lO's of sen/ers, 100's of gateway 
machines, and 1000's of endpoints. Each sen/er typical- 
ly serves up to 200 gateways, each of which services 
1000's of endpoints. At the framework level, all opera- 
tions to or from an endpoint pass through a gateway ma- 
chine. In many operations, the gateway is transparent; 
it receives a request, determines the targets, resends 
the requests, waits tor results, then returns results back 
to the caller. Each gateway handles multiple simultane- 
ous requests, and there may be any number of gate- 
ways in an enterprise, with the exact number depending 
on many factors including the available resources and 
the number of endpoints that need to be serviced. 

Thus, according to the present invention, one or 
more of the endpoints 18 are "dataless." Except for the 
LCF executable itself, preferably there is no database 
or other state that must be kept on the endpoint. When- 
ever executable code and/or system management data 
are needed for a particular system management task, 
such code and/or data are delivered from a particular 
gateway machine to the affected endpoints machines 
automatically. Typically, the executable code and/or da- 



ta will remain valid for the particular task invocation; 
while the executable code may be cached, usually the 
system management data will not be. If the executable 
code is cached, preferably the system management 

5 framework may be configured to automatically resend 
such code without operator intervention if the cache is 
lost or periodically flushed. 

An endpoint Is added to the enterprise by first cop- 
ying the LCF daemon 24a to the endpoint's disk. This 

10 may be done autcimatically through network login 
scripts, manually by inserting a diskette, or by preload- 
ing the boot disk at the time of purchase or license. The 
first time the LCF daemon is installed, and on each sub- 
sequent boot, the LCF daemon attempts to login to its 

15 gateway. If the gateway is not known or if the gateway 
does not respond, the daemon issues a broadcast re- 
questing a gateway. For completely new endpoints the 
broadcast is ultimately forwarded to the server. If a gate- 
way hears a broadcast or a login request from an end- 

20 point it recognizes, the gateway services the request it- 
self. ~ 

When the server receives an endpoint's gateway re- 
quest broadcast: the server consults its endpoint list to 
see which gateway the endpoint belongs to. For new 

25 endpoints, or when migrating between gateways, the 
server uses a site specific policy to choose the correct 
gateway (e.g., by subnet). The gateway is informed of 
its new endpoint, the gateway informs the endpoint, and 
the login completes. 

30 An endpoint preferably communicates only with its 
gateway. Requiring all endpoint communication to pass 
through a single gateway greatly simplifies connectivity 
issues. After a successful login, both endpoint and gate- 
way know a working address by which to address one 

35 another. If a DHCP address lease expires, or anything 
changes in the network topology, then the next endpoint 
login will establish the new endpoint to gateway ad- 
dresses. 

There is no absolute maximum number of endpoints 

40 that can be supported by a single gateway. The design 
strategy is that the gateway is always in control of its 
own workload. The endpoints are not allowed to send 
data unless granted permission. When an endpoint has 
results to return, or if it wishes to make an upcall, It sends 

45 a very small message requesting service. The gateway 
queues the request and services the queue as time al- 
lows. When an endpoint has large results, it must break 
the results into chunks and may only send a chunk when 
instructed to do so. This stralegy makes it possible for 

50 a single gateway to support thousands of endpoints, al- 
beit somwhat slowly. If a better quality of service is de- 
sired, it is simply a matter of adding more gateways. 

Endpoint methods are normal CORBA methods (as 
discussed below) linked with IDL compiler generated 

55 code and the endpoint application runtime library 24b. 
~ This results in a native executable designed to be 
spawned by the LCF daemon 24a. Any number of meth- 
ods may be implemented in a single executable. 
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Preferably, an endpoint is installed without any 
methods. Method executables are downloaded from the 
gateway as required. When the LCF daemon receives 
a method invocation request, it checks the local disk 
cache. If there is a cache miss, or a version mismatch, 
then a new executable is downloaded. In this way an 
endpoint can start with nothing and then build a working 
set of methods for fast execution. 

Before illustrating how the framework is used, the 
following background is provided. FIGURE 4 first illus- 
trates how a systems management task is implemented. 
Each authorized administrator 30 has access to a desk- 
top computer 32 containing one or more Icons repre- 
senting system resources. As administrators interact 
with dialog screens and menus available from these 
icons, they are able to change system configurations 
and manage new resources in the distributed environ- 
ment, all in a known manner In particular, when admin- 
istrator 30 interacts with the desktop, so-called "call- 
backs" (i.e. software that responds to the user's actions) 
are invoked from the user interface on underlying ob- 
jects representing some system resource or compo- 
nent. These callbacks are translated into a series of 
method invocations that actually perfomr> the work and 
return results or status to the administrator In particular, 
and with reference to the process flow diagram of FIG- 
URE 4, the information flow begins when the adminis- 
trator 30 selects an icon or interacts with a dialog. The 
information is then sent to the desktop (which may be 
connected to a server or a gateway) at step 34. at which 
time the appropriate application callback method is in- 
voked at step 36. The callback method then invokes 
core application methods at step 39, which communi- 
cate with the application object(s) toperfomi some sys- 
tem management operation, as illustrated at step 39. 
Any return information or state is passed back at steps 
40 and 41 . If an update to the user interface is required, 
the desktop 32 interprets the output and updates the di- 

- - alogs on-the administrator/Es desktop at step 42. 

Preferably, the framework includes a task library 
that enables administrators to create 'shell" scripts that 
can run an any managed node of the enterprise envi- 
ronment. A shell script integrated with a managed node 
is called a "task." When administrators want to create a 
task, they provide a machine and a path to an executa- 
ble file. The executable can be a shell script, a special- 
ized script, a compiled program or any other kind of valid 
executable. When a task is created, the executable is 
stored as an attribute in an object database associated 
with a gateway machine. When the task is needed, the 
executable file is retrieved from the attribute and is pro- 
vided to one or more managed nodes. After a task is 
created, it is added to the task library and displayed as 
an icon. 

FIGURE 5 illustrates how systems management 
applications are constructed from a set of standard ob- 
ject types that together constitute a general applications 
architecture for the framework. These standard object 



types are closely interrelated and are used to build ob- 
jects for system management applications. The objects 
provided by a management application can be divided 
into one or nnore object types 44 that are managed by 

5 one or more instance managers 46. An object type is a 
set of objects that share a common interface. Each ob- 
ject of the same type is called an instance of the type. 
Within the systems management framework, object 
types are associated with a particular instance manager 

10 The instance managers are registered and stored in a 
library 48, which provides a central repository for system 
administration object infomnation within a managed re- 
gion. 

As referenced above, the systems management 
IS provides an implementation of a CORBA 1 . 1 Object Re- 
quest Broker (ORB), basic object adaptor (BOA), and 
related object services. CORBA 1 . 1 is a specification for 
an object-oriented distributed computer systems man- 
agement architecture provided by The Object Manage- 
20 menl Group (OMG), a non-profit association of more 
than 300 companies. CORBA describes the use of the 
Object Request Broker'(eRB) and basic object adaptor 
(BOA) that provide a mechanism for object invocation 
and return of results. The specification defines interfac- 
es es to a set of low-level object sen/ices and enables such 
services to be integrated in many different language and 
systems using object encapsulation, service requestor/ 
provider isolation, and interface and implementation 
separation. 

30 In a CORBA 1.1 implementation as seen in FIGURE 
6. there are three primary components: a client, an ob- 
ject implementation, and the ORB/BOA. The client 50 is 
the requestor of a service that is provided by an object 
implementation 52. The ORB 21 delivers the request 
35 from the client 50 to the object implementation 52 
through the BOA 27. The object implementation 52 then 
performs the requested service, and any retum data is 
delivered back to the client. The client and object imple- 

mentation are isolated from each other, an_d neither has 

40 any knowledge of the other except through their ORB/ 
BOA interfaces. Client requests are independent of the 
object implementation location and the programming 
language in which they are implemented. 

The ORB delivers the request to the BOA, which 
45 activates the process under which the object implemen- 
tation (e.g.. a server) runs. The BOA then invokes the 
method associated with the request by way of a sen/er 
skeleton 61 . When the method is finished, the BOA 
manages the termination of the method and coordinates 
so the return of any results to the "client. Alternatively, if a 
request is unknown until runtime, a Dynamic Invocation 
Interface (Dll) 55 is used to build a request used in place 
of a client stub 63 linked at compile time. 

With the above background, several examples of 
55_ how system management tasks are- carried out using 
the client-server framework are now described. The fol- 
lowing discussion is merely exemplary, as the nature 
and type of systems management tasks that can be car- 
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ried out by the framework is unconstrained. As noted 
above, the system management framework includes the 
server component supported on the gateway machines 
of the managed region, and the associated client com- 
ponent is supported on the endpoints serviced by the 
gateway(s). To provide transparent functionality, every 
method to be run on an endpoint is preferably configured 
on the gateway All.endpoint object references (Objrefs) 
that other methods use to invoke operations on an end- 
point are Objrefs and each endpoint is assigned to a 
unique object dispatcher number (odnum). The gateway 
machine then uses true object references to "point" to 
the actual endpoint(s). 

The routine illustrated in FIGURE 7 occurs when 
one method invokes a method of an object on an end- 
point. At step 70, the server 1 4 resolves the Objref/meth- 
od to a specific method daemon on a gateway. At step 
72, the gateway is started (if it is not already running) 
and passed a method activation record or so-called 
MAR block. The MAR block contains all information nec- 
essary to invoke a method including the target Objref, 
the target method, 'marshalled" input arguments and 
other fields from the server's method store. As noted 
above with respect to FIGURE 6, when a request to run 
an operation on some object is made, a client stub ini- 
tiates the request, collects the data associated with the 
request, and converts It from its current format to a com- 
mon format. This process is known as "data marshal- 
ling" and is performed in accordance with the ASN.1 
Standard. Up to this point, there is no difference as far 
as the application is concerned between this method in- 
vocation and any other method invocation. At step 74. 
the gateway looks at the Objref to determine the target 
endpoint. A connection to the endpoint is then opened 
at step 76. 

The method continues at step 78 and queries 
whether the endpoint has a copy of the executable for 
the method. If not, the routine continues at step 80 to 
retrieve the appropriate executable from the object da- 
tabase (based on the endpoint's machine type), and the 
executable is then sent to the endpoint. At step 82. or if 
the endpoint already had the necessary executable, the 
gateway sends the still-marshalled input arguments to 
the endpoint. At step 84, the gateway enters into a wait 
state, waiting for the marshalled output arguments to be 
returned. When this occurs, the routine continues at 
step 86 and closes the connection to the endpoint. The 
output arguments are then sent back to the server at 
step 88 and the routine terminates as though the method 
had executed locally. 

.. Endpoint- initiated operations can now be de- 
scribed. In these types of operations, the endpoint first 
attempts to establish a connection back to its gateway. 
Preferably, this is done in the same way that the end- 
point client attempts to identify itself upon boot. It first 
tries its last known gateway host; if it gets no response, 
the endpoint then contacts the server's TN Manager rou- 
tine either directly or, if that fails, via a BOOTP broad- 



cast. Once connected, the server component of the 
gateway is invoked and the endpoint passes a descrip- 
tor telling the gateway which endpoint has connected. 
Upcalls are method requests originating at the end- 
5 point. The Tivoli Courier"^ pull interface is an example 
of a program making upcalls. Not all applications will 
need upcalls. An endpoint makes an upcall by calling 
an IDL compiler generated stub. Unlike a normal meth- 
od invocation, an upcall always invokes a special upcall 
10 method on the gateway An application that supports up- 
calls must provide both the endpoint code and the spe- 
cial upcall method. The endpoint upcall facility is not a 
general purpose method invocation interface. This de- 
sign maintains scalability by handling upcalls at the local 
IS gateway without management server intervention. 

An illustrated "file pull" operatbn is shown in FIG- 
URE 8A. In this operation, the endpoint-resident code 
(i.e. the client component) initiates a file request, and 
the gateway-resident code (i.e. the server component) 
20 handles the request. Thejouline begins at step 96. with 
the user on the endpoint invoking a local program that 
is part of the client con^nent of a file distribution ap- 
plication. This step invokes a method asking for a list of 
all available filepacks. A "filepack" is a collection of files 
25 being distributed in accordance with a system manage- 
ment task. At step 98. the gateway receives the request, 
invokes the server component of the file distribution ap- 
plication, passing it the descriptor identifying the end- 
point. The software distribution program, running as a 
30 method on the gateway, issues a name service lookup 
at step 1 00, filters out filepacks that the endpoint is al- 
lowed to receive at step 102, and then returns the list to 
the endpoint at step 104. Depending on the actions of 
the user, the endpoint may then invoke another method 
3$ asking for a specific filepack. 

FIGURE 8B Illustrates an inventory reporting sys- 
tem management task. In this example, assume that an 
inventory application is designed to collect inventory in- 
formation (e.g.. the hardware and software configura- 
te tion details) from a machine whenever the machine is 
rebooted. It is also assumed that an initialization method 
has been run on each endpoint to edit the appropriate 
boot-time script to run the inventory report program on 
boot. The routine starts upon the next boot. In particular, 
45 at step 106. the program runs the local inventory report 
to produce a data file of inventory results. At step 108, 
the endpoint connects to the gateway in the manner pre- 
viously described. The endpoint then invokes an "inven- 
tory report" method at step 110 and, at step 112, sends 
so the results to the gateway. At sTep 1 1 4, the server com- 
ponent of the inventory report method is invoked and 
passed a descriptor identifying the endpoint. The inven- 
tory results are road by the gateway at step 116 and, at 
step 118, forwarded to a central inventory database ob- 
55 ^ ject. 

The LCF is designed with efficient distributions op- 
erations in mind. Each gateway is also a repeater for 
each of the gateway's endpoints. This relationship is im- 
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plicit so It does not need to be set in the repeater con- 
figuration. Distributions fan out from the gateway to the 
endpoints without further server support. Eliminating the 
server as a bottleneck makes distributions very scalea- 

b!e. 

As can be seen, the endpoint remains essentially 
dataless except as needed to carry out the particular 
system management task. This operation is facilitated 
by distributing the system management framework 
across client components, supported on the endpoints, 
and server components, supported on the gateways 
that service the endpoints. 

As noted above, the LCF is extensible and thus new 
application functionality is easily added to the system 
management framework without having to rebuild or re- 
install the client component at the endpoint. The specif- 
ics on how the LCF is installed and activated vary from 
OS type to OS type. On some operating systems there 
is a background task (e.g., inted on Unix) that listens for 
incoming network conneclbns. On other operating sys- 
tems, the Icf itself is configured as a background task 
that is dormant until a network request activates it. in all 
cases, a persistent network listener of the Icf is relatively 
small. The network listener will fork/spawn/invoke the 
larger application component (as the case may be) in 
response to a request from the client. 

According to the invention, the code running on 
each endpoint is a small subset of a object-oriented 
CORBA runtime. This subset has sufficient functionality 
to implement methods such as the filepack distribution 
methods and CCMS profile endpoint methods (dataless 
model). The basic components are ADR encoding rou- 
tines for argument marshaling, standalone IPC library, 
bdt (block data transfer) and iom. message catalogs for 
118N, memory management and exception handling. 

One of the preferred implementations of the client 
component of the system management framework is as 
a set of instructions in a code module resident in the 
Tanddm access'memory of the endpointr tJntil 
by the computer, the set of instructions may be stored 
in another computer memory, tor example, in a hard disk 
drive, or in a removable memory such as an optical disk 
(for eventual use in a CD ROM) or floppy disk (for even- 
tual use in a floppy disk drive), or even downloaded via 
the Internet. In addition, although the various methods 
described are conveniently implemented in a general 
purpose computer selectively activated or reconfigured 
by software, one of ordinary skill in the art would also 
recognize that such methods may be carried out in hard- 
ware, in firmware, or in more specialized apparatus con- 
structed to perform the required method steps. 

Further, although the invention has been described 
in terms of a preferred embodiment in a specific network 
environment, those skilled in the art will recognize that 
the invention can be practiced, with modification/in oth- 
er and different network architectures with the spirit and 
scope of the appended claims The present invention, 
however, is not to be construed as limited to the partic- 



ular managed architecture illustrated and thus in a more 
general sense the inventive use of a client-server man- 
aged framework should be broadly construed to be use- 
ful in any distributed network configuration. 



Claims 

1. A method of managing computing resources in a 
10 large distributed enterprise, comprising the steps 

of: 

organizing the computing resources into one or 
more managed regions, each region being 
IS nnanaged by a server machine servicing one or 

more gateway machines, each gateway ma- 
chine servicing a plurality of endpoint ma- 
chines: and 

20 delivering, on anas-needed basis, executable 

code and system nnanagement data from a 
gateway to one-or more endpoint machines 
serviced by the gateway to facilitate execution 
of at least one system management task aff ect- 

25 ing the managed region in the one or more end- 

point machines. 

2. The method as described in Claim 1 wherein data 
is used by the one or more endpoint machines to 

30 facilitate execution of the system management task 
and is not stored for subsequent use. 

3. The method as described in Claim 1 further includ- 
ing the step of caching the executable code for sub- 

35 sequent use. 

4. The method as described in Claim 1 further includ- 
ing the step of maintaining the computing resources 

by backing up system management data only from 

40 the server and the one or more gateway machines. 

5. The method as described in Claim 1 wherein at 
least some of the endpoint machines are personal 
computers. 

45 

6. The method as described in Claim 1 wherein each 
of the endpoint machines runs a client component 
of a system management framework and wherein 
the gateway machine that services the endpoint 

so machines runs a server component of the system 
management framework. 

7. The method as described in Claim 6 wherein the 
client component of the system management 

55 _ franhework has an operating state that is nontially 

idle. 

8. The method as described in Claim 6 wherein the 
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server component of the system management 
framework executes in a multi-threaded fashion. 



gateway machine and for spawning an executable 
downloaded from the gateway machine 



9. In a large distributed enterprise wherein computing 
resources are organized into one or more managed 
regions, each region being managed by a server 
machine servicing one or more gateway machines, 
each gateway machine servicing a plurality of end- 
point machines, a method of executing a system 
management task affecting a managed region, 
comprising the steps of: 

delivering executable code and system man- 
agement data required tor the system manage- 
ment task from a gateway machine to one or 
more endpoint machines serviced by the gate- 
way; and 

using a distributed system management f rame- 
i work supported on the gateway machine and 
the one or more endpoint machines to carry out 
the system management task using the execut- 
able code and system management data. 

10. In the enterprise as described in Claim 9 wherein 
the distributed system management framework in- 
cludes a client component supported on each of the 
one or more endpoint machines and a server com- 
ponent supported on the gateway machine that 
services the one or more endpoint machines. 
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20 



25 



30 



11. A computer connectable into a large distributed en- 
terprise having a server machine servicing a set of 
gateway machines, each of which services a set of 
endpoint machines, comprising: 



a processor; 

an operating system; and 

a client component of a system management 
framework, the client component having an as- 
sociated server component supported on a 
gateway machine; wherein the client compo- 
nent ot the system management framework in- 
cludes means responsive to receipt of execut- 
ables and system management data from the 
gateway machine to facilitate execution of a 
system management task affecting the man- 
aged region into which the computer is con- 
nected. 



40 



45 



so 



12. The computer as described in Claim 11 wherein the 
means includes a daemon and an application runt- 
ime library. . 

13. The computer as described in Claim 12 wherein the 
LCF daemon is responsible for endpoint login to a 
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